33 research outputs found

    Preface

    Get PDF
    Proceedings of the Workshop CHAT 2011: Creation, Harmonization and Application of Terminology Resources. Editors: Tatiana Gornostay and Andrejs Vasiļjevs. NEALT Proceedings Series, Vol. 12 (2011), v-vi. © 2011 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/16956

    Proceedings

    Get PDF
    Proceedings of the Workshop CHAT 2011: Creation, Harmonization and Application of Terminology Resources. Editors: Tatiana Gornostay and Andrejs Vasiļjevs. NEALT Proceedings Series, Vol. 12 (2011). © 2011 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/16956

    Development of Text-To-Speech System for Latvian

    Get PDF
    Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Joakim Nivre, Heiki-Jaan Kaalep, Kadri Muischnek and Mare Koit. University of Tartu, Tartu, 2007. ISBN 978-9985-4-0513-0 (online) ISBN 978-9985-4-0514-7 (CD-ROM) pp. 67-72

    Consolidation of Heterogeneous Terminology Resources

    Get PDF
    Elektroniskā versija nesatur pielikumusAndrejs Vasiļjevs Promocijas darbā piedāvātas jaunas pieejas datu sistēmu veidošanā un datorizētā informācijas apstrādē, lai risinātu aktuālas problēmas terminoloģijas jomā. Promocijas darbs teorētiski un praktiski parāda, ka terminoloģijas resursu fragmentācijas un heterogenitātes radītās pieejamības un lietojamības problēmas var efektīvi atrisināt ar federatīvu daudzvalodu sistēmu, kas nodrošina konsolidētu datu reprezentāciju un integrāciju lietojumprogrammatūrā. Darbā piedāvāta vienota metodoloģija konsolidētu sistēmu izveidē, kas ietver svarīgākos soļus no scenārijos balstītas prasību analīzes līdz autonomu datubāzu federācijai. Piedāvāts jauns princips - terminu ierakstu sastatīšana, lai identificētu un apvienotu semantiski atbilstošus daudzvalodu terminus. Semantiskajai salīdzināšanai piedāvāta automatizēta korpusbāzēta kontekstanalīzes metode. Atslēgvārdi: datorsistēmas, terminoloģija, federatīvās datubāzes, terminu sastatīšana, terminoloģijas konsolidācijaAndrejs Vasiļjevs This thesis researches data management and accessibility problems in the terminology domain. The thesis theoretically and practically demonstrates that the usability problems posed by the fragmentation and heterogeneity of terminology resources can be effectively solved with a federated multilingual database system that consolidates data representation and integration in authoring software. A unified consolidation methodology is proposed covering all major aspects from scenario based requirements analysis, data modeling, storage and representation to the federation of autonomous databases. The thesis introduces a new concept of terminology entry compounding for identification of semantically matching multilingual terms from different resources. An automated corpus based on context analysis is proposed for term sense disambiguation in entry compounding. Keywords: computing, terminology, federated databases, term compounding, terminology consolidatio

    From Terminology Database to Platform for Terminology Services

    Get PDF
    Proceedings of the Workshop CHAT 2011: Creation, Harmonization and Application of Terminology Resources. Editors: Tatiana Gornostay and Andrejs Vasiļjevs. NEALT Proceedings Series, Vol. 12 (2011), 16-21. © 2011 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/16956

    Terminology localization guidelines for the national scenario

    Get PDF
    The paper is a preprint of the paper accepted to the LREC 2014 : The 9th edition of the Language Resources and Evaluation Conference scheduled May 28, 2014 - May 30, 2014 in Reykjavik (Iceland).This paper presents a set of principles and practical guidelines for terminology work in the national scenario to ensure a harmonized approach in term localization. These linguistic principles and guidelines are elaborated by the Terminology Commission in Latvia in the domain of Information and Communication Technology (ICT). We also present a novel approach in a corpus-based selection and an evaluation of the most frequently used terms. Analysis of the terms proves that, in general, in the normative terminology work in Latvia localized terms are coined according to these guidelines. We further evaluate how terms included in the database of official terminology are adopted in the general use such as newspaper articles, blogs, forums, websites etc. Our evaluation shows that in a non-normative context the official terminology faces a strong competition from other variations of localized terms. Conclusions and recommendations from lexical analysis of localized terms are provided. We hope that presented guidelines and approach in evaluation will be useful to terminology institutions, regulative authorities and researchers in different countries that are involved in the national terminology work.The research leading to these results has received funding from the research project “Optimization methods of large scale statistical models for innovative machine translation technologies” of European Regional Development Fund, contract nr. 2013/0038/2DP/2.1.1.1.0/13/APIA/VIAA/029

    Comprehension Assistant for Languages of Baltic States

    Get PDF
    Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Joakim Nivre, Heiki-Jaan Kaalep, Kadri Muischnek and Mare Koit. University of Tartu, Tartu, 2007. ISBN 978-9985-4-0513-0 (online) ISBN 978-9985-4-0514-7 (CD-ROM) pp. 167-174

    Open Terminology Management and Sharing Toolkit for Federation of Terminology Databases

    Full text link
    Consolidated access to current and reliable terms from different subject fields and languages is necessary for content creators and translators. Terminology is also needed in AI applications such as machine translation, speech recognition, information extraction, and other natural language processing tools. In this work, we facilitate standards-based sharing and management of terminology resources by providing an open terminology management solution - the EuroTermBank Toolkit. It allows organisations to manage and search their terms, create term collections, and share them within and outside the organisation by participating in the network of federated databases. The data curated in the federated databases are automatically shared with EuroTermBank, the largest multilingual terminology resource in Europe, allowing translators and language service providers as well as researchers and students to access terminology resources in their most current version.Comment: LREC 202

    Informācijas tehnoloģijas terminu lietošanas paradumi publiskajā saziņā

    Get PDF
    Informācijas un komunikācijas tehnoloģijas (IKT) termini galvenokārt tiek radīti angļu valodā un pēc tam lokalizēti citās valodās. Valodu morfoloģisko un terminrades tradīciju atšķirību dēļ šāda lokalizācija mēdz būt diezgan haotiska. Latvijas IKT terminu lokalizētāji ir izstrādājuši samērā stingru, t.s. kvazialgoritmisko pieeju, ko aprobējuši vairāk nekā 15 gadu laikā. Šajā rakstā uz biežāk lietoto terminu piemēra parādīta pieejas dzīvotspēja. Noraidīts izplatītais uzskats, ka IKT terminu lokalizējumi piesārņo latviešu valodu ar svešvārdiem. Analizēta oficiāli apstiprināto terminu lietojamība tekstos un iemesli, kādēļ tie dažkārt sastop ikdienas lietotāju pretestību

    META-NORD: Baltic and Nordic Branch of the European Open Linguistic Infrastructure

    Get PDF
    Proceedings of the NODALIDA 2011 Workshop Visibility and Availability of LT Resources. Editors: Sjur Nørstebø Moshagen and Per Langgård. NEALT Proceedings Series, Vol. 13 (2011), 18–22. © 2011 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/1697
    corecore